Exploratory Analysis of table place_name¶

/home/kleiolab/Desktop/toolkit/src/file.py:58: DtypeWarning:

Columns (5,7,8,16,17) have mixed types. Specify dtype option on import or set low_memory=False.

Table extract¶

Shape:  (128529, 18)
pk_named_place_name concat_plna name is_standard_name number complement_number lang_iso comment_begin_year comment_end_year notes fk_named_place creator creation_time modifier modification_time fk_abob_napl_name_type begin_date end_date
0 152213.0 PlNa152213 Hellemmes True NaN NaN NaN NaN NaN NaN 120967.0 4.0 2012-07-13 09:33:10.300 4.0 NaN NaN NaN NaN
1 152242.0 PlNa152242 Espagne True NaN NaN NaN NaN NaN NaN 48.0 4.0 2012-09-10 14:39:36.470 4.0 NaN NaN NaN NaN
2 152214.0 PlNa152214 Col de Parmnie True NaN NaN NaN NaN NaN NaN 90320.0 4.0 2012-07-13 09:37:12.680 4.0 2012-07-13 09:50:09 NaN NaN NaN
3 152215.0 PlNa152215 Chile False NaN NaN spa NaN NaN NaN 121167.0 3.0 2012-07-30 10:29:41.330 3.0 NaN NaN NaN NaN
4 90593.0 PlNa90593 Sicile (le) True NaN NaN fra NaN NaN NaN 90270.0 3.0 2010-06-21 08:19:56.000 3.0 2012-08-03 10:30:30 NaN NaN NaN

Discovery¶

Columns contain:
Total number of rows: 128529
  -    "pk_named_place_name":   0.00% empty - 128529 (100.00%) uniques (eg: 152213.0; 152242.0; 152214.0)
  -            "concat_plna":   0.00% empty - 128529 (100.00%) uniques (eg: PlNa152213; PlNa152242; PlNa152214)
  -       "is_standard_name":   0.00% empty -      2 (  0.00%) uniques (eg: True; False)
  -         "fk_named_place":   0.00% empty - 127466 ( 99.17%) uniques (eg: 120967.0; 48.0; 90320.0)
  -          "creation_time":   0.00% empty -   4635 (  3.61%) uniques (eg: 2012-07-13...; 2012-09-10...; 2012-07-13...)
  -                "creator":   0.00% empty -     50 (  0.04%) uniques (eg: 4.0; 3.0; 2.0)
  -               "modifier":   0.00% empty -     49 (  0.04%) uniques (eg: 4.0; 3.0; 2.0)
  -                   "name":   0.04% empty -  85361 ( 66.41%) uniques (eg: Hellemmes; Espagne; Col de Par...)
  -      "modification_time":   1.24% empty -    747 (  0.58%) uniques (eg: nan; 2012-07-13...; 2012-08-03...)
  -                  "notes":  63.97% empty -    769 (  0.60%) uniques (eg: nan; Nom donn a...; Nom offici...)
  -               "lang_iso":  70.23% empty -     30 (  0.02%) uniques (eg: nan; spa; fra)
  -                 "number":  76.15% empty -    532 (  0.41%) uniques (eg: nan; 1.0; 4.0)
  -      "complement_number":  81.62% empty -     38 (  0.03%) uniques (eg: nan;    ;   )
  - "fk_abob_napl_name_type":  95.18% empty -      3 (  0.00%) uniques (eg: nan; 1253.0; 697.0)
  -             "begin_date":  96.52% empty -    113 (  0.09%) uniques (eg: nan; 2016-12-15; 2019-01-01)
  -     "comment_begin_year":  96.52% empty -     23 (  0.02%) uniques (eg: nan; fondation ; autour de)
  -       "comment_end_year":  96.52% empty -     36 (  0.03%) uniques (eg: nan; devient Ma...; reconstitu...)
  -               "end_date":  96.53% empty -    178 (  0.14%) uniques (eg: nan; 1849-06-20; 1946-11-14)

Type parsing¶

According to the table before, we will parse each column by the most meaningful type.

Columns analysis¶

Here we will report the analysis of interesting information found on different columns. They are not exhaustive.

fk_abob_napl_name_type¶

creation_time¶

creator¶

lang_iso¶